A Modular Q-Learning Architecture for Manipulator Task Decomposition
نویسندگان
چکیده
Compositional Q-Learning (CQ-L) (Singh 1992) is a modular approach to learning to perform composite tasks made up of several elemental tasks by reinforcement learning. Skills acquired while performing elemental tasks are also applied to solve composite tasks. Individual skills compete for the right to act and only winning skills are included in the decomposition of the composite task. We extend the original CQ-L concept in two ways: (1) a more general reward function, and (2) the agent can have more than one actuator. We use the CQ-L architecture to acquire skills for performing composite tasks with a simulated twolinked manipulator having large state and action spaces. The manipulator is a non-linear dynamical system and we require its end-effector to be at specific positions in the workspace. Fast function approximation in each of the Q-modules is achieved through the use of an array of Cerebellar Model Articulation Controller (CMAC) (Albus 1975) structures.
منابع مشابه
A modular learning architecture for orienting a robot in a visual servoing task
A robust modular neural architecture is developed for the position/orientation control of a robot manipulator with visual feedback. Modular learning enhances the neural networks capabilities to learn and approximate complex problems. The proposed bidirectional modular learning architecture avoids the neural networks wellknown limitations. Simulation results on a 4 degrees of freedom robot are r...
متن کاملTask Decompostiion Through Competition in a Modular Connectionist Architecture: The What and Where Vision Tasks
A novel modular connectionist architecture is presented in which the networks composing the architecture compete to learn the training potterns. An outcome of the competition is that different networks learn different training patterns and, thus, learn to compute different functions. The architecture performs task decomposition in the sense that it learns to partition a task into two or more fu...
متن کاملOpen Modular Robot Control Architecture for Assembly Using the Task Frame Formalism
The task frame formalism allows the programmer to overcome the drawbacks of the traditional robot oriented assembly programming, moving the programmer’s focus on the robot task. Additionally skill primitives contribute to a more natural programming paradigm. In this paper a robot control architecture is presented that implements both of these concepts providing a framework to easily implement n...
متن کاملA Competitive Modular Connectionist Architecture
We describe a multi-network, or modular, connectionist architecture that captures that fact that many tasks have structure at a level of granularity intermediate to that assumed by local and global function approximation schemes. The main innovation of the architecture is that it combines associative and competitive learning in order to learn task decompositions. A task decomposition is discove...
متن کاملTask decomposition through competition in a modular connectionist architecture: The what and where vision tasks
A novel modular connectionist architecture is presented in which the networks composing the architecture compete to learn the training patterns. An outcome of the competition is that di erent networks learn di erent training patterns and, thus, learn to compute di erent functions. The architecture performs task decomposition in the sense that it learns to partition a task into two or more funct...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1994